rank | frequency | n-gram |
---|---|---|
1 | 15211 | -s |
2 | 5719 | -a |
3 | 5005 | -ė |
4 | 3108 | -u |
5 | 2340 | -o |
rank | frequency | n-gram |
---|---|---|
1 | 2265 | -os |
2 | 2126 | -ės |
3 | 1843 | -as |
4 | 1653 | -is |
5 | 1217 | -es |
rank | frequency | n-gram |
---|---|---|
1 | 660 | -uos |
2 | 647 | -ėjė |
3 | 556 | -jės |
4 | 456 | -ėma |
5 | 454 | -nės |
rank | frequency | n-gram |
---|---|---|
1 | 501 | -ėjės |
2 | 298 | -ėnės |
3 | 212 | -ėnis |
4 | 197 | -ėniu |
5 | 187 | -ėjuo |
rank | frequency | n-gram |
---|---|---|
1 | 124 | -cėjės |
2 | 85 | -acėjė |
3 | 79 | -tėnės |
4 | 76 | -ninks |
5 | 72 | -avėma |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings